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Abstract 

We investigate the space efficiency of a Propositional Knowledge Representation (PKR) 
formalism. Intuitively, the space efficiency of a formalism F in representing a certain piece 
of knowledge a, is the size of the shortest formula of F that represents a. In this paper we 
assume that knowledge is either a set of propositional interpretations (models) or a set of 
propositional formulae (theorems) . We provide a formal way of talking about the relative 
ability of PKR formalisms to compactly represent a set of models or a set of theorems. We 
introduce two new compactness measures, the corresponding classes, and show that the 
relative space efficiency of a PKR formalism in representing models/theorems is directly 
related to such classes. In particular, we consider formalisms for nonmonotonic reasoning, 
such as circumscription and default logic, as well as belief revision operators and the stable 
model semantics for logic programs with negation. One interesting result is that formalisms 
with the same time complexity do not necessarily belong to the same space efficiency class. 



1. Introduction 

During the last years a large number of formalisms for knowledge representation (KR) have 
been proposed in the literature. Such formalisms have been studied from several perspec- 
tives, including semantical properties, and computational complexity. Here we investigate 
space efficiency, a property that has to do with the minimal size needed to represent a cer- 
tain piece of knowledge in a given formalism. This study is motivated by the fact that the 
same piece of knowledge can be represented by two formalisms using a different amount of 
space. Therefore, all else remaining the same, a formalism could be preferred over another 
one because it needs less space to store information. 

The definition of space efficiency, however, is not simple. Indeed, a formalism may allow 
several different ways to represent the same piece of knowledge. For example, let us assume 
that we want to represent the piece of knowledge "today is Monday". In Propositional 
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Logic we may decide to use a single propositional variable monday. The fact that today is 
Monday can be represented by the formula monday, but also by the formula - ^monday, 
as well as monday A (rain V -'rain), because all formulae of the Propositional Logic that 
are logically equivalent to monday represent exactly the same information. 

In Propositional Logic, we should consider the shortest of the equivalent formulae used 
to represent the information we have. The same principle can be applied to a generic 
formalism: if it allows several formulae to represent the same information, then we only take 
into account the shortest one. Therefore, we say that the space efficiency of a formalism F 
in representing a certain piece of knowledge a is the size of the shortest formula of F that 
represents a. Space efficiency — also called succinctness or compactness — of a formalism 
is a measure of its ability in representing knowledge in a small amount of space. 

In this paper we focus on propositional KR (PKR) formalisms. We do not give a 
formal definition of which formalisms are propositional and which one are not: intuitively, 
in a propositional formalism, quantifications are not allowed, and thus the formulae are 
syntactically bounded to be formed only using propositional connectives, plus some other 
kind of nonclassical connectives (for instance, negation in logic programs, etc.). 

So far, we have not discussed what knowledge represents. A possible way to think of a 
piece of knowledge is that it represents all facts that can be inferred from it. In other words, 
knowing something is the same as knowing everything that can be logically implied. The 
second way — which is in some cases more natural — is to think of a piece of knowledge 
as the set of states of the world that we consider possible. 

In a more formal way, we say that knowledge is represented either by a set of proposi- 
tional interpretations (those describing states of the world we consider plausible) or a set 
of formulae (those implied from what we know). Consequently, we focus on both reasoning 
problems of model checking and theorem proving. The following example shows that we 
can really think of knowledge in both ways. 

Example 1 We want to eat in a fast food, and want to have either a sandwich or a salad 
(but not both), and either water or coke (but not both). 

In Propositional Logic, each choice can be represented as a model, and the following 
models represent all possible choices (models are represented by writing down only the letters 
mapped to true). 

A = {{sandwich, water}, {sandwich, coke}, {salad, water}, {salad, coke}} 

For representing the set of choices we can use formulae instead of models. In this case, 
we write down a set of formulae whose models represent exactly the allowed choices, as 
follows. 

C = (sandwich V salad) A (^sandwich V ^salad) A (sandwich — > ^salad) A 
(water V coke) A (^waterV ^coke) A (^coke — ► water) 

Actually, we can get rid of redundancies, and end up with the following formula. 
F = (sandwich V salad) A (^sandwich V ^salad) A (water V coke) A (^waterV ^coke) 
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More formally, F represents the set of models A, because for each interpretation I, I £ A 
holds if and only if I \= F. The formula F also represents the set of formulae C, because 
Cn(F) = Cn(C), where Cn(.) is the function that gives the set of all conclusions that can 
be drawn from a propositional formula. 

1.1 State of the Art 

A question that has been deeply investigated, and is related to space efficiency, is the 
possibility of translating a formula expressed in one formalism into a formula expressed in 
another formalism (under the assumption, of course, that these formulae represent the same 
knowledge) . 

In most cases, the analysis is about the possibility of translating formulae from different 
formalisms to Propositional Logic (PL). For example, Ben-Eliyahu and Dechter (1991, 1994) 
proposed a translation from default logic to PL, and a translation from disjunctive logic 
programs to PL, while Winslett (1989) introduced a translation from revised knowledge 
bases to PL, and Gelfond, Przymusinska, and Przymusinskyi (1989) defined a translation 
from circumscription to PL. 

All the above translations, as well as many other ones in the literature, lead to an 
exponential increase of the size of the formula, in the worst case. When the best known 
translation yields a formula in the target formalism which has exponential size w.r.t. the 
formula in the source formalism, a natural question arising is whether such exponential 
blow up is due to the specific translation, or is intrinsic of the problem. For example, 
although all proposed translations from default logic to PL lead to the exponential blow 
up, we cannot conclude that all possible translations suffer from this problem: it could be 
that a polynomial translation exists, but it has not discovered so far. 

Some works have focussed on the question of whether this kind of exponential increase 
in the size is intrinsic or not. Cadoli, Donini, and Schaerf (1996) have shown that many in- 
teresting fragments of default logic and circumscription cannot be expressed by polynomial- 
time fragments of PL without super-polynomially increasing the size of formulae. It has 
been proved that such a super-polynomial increase of size is necessary when translating 
unrestricted propositional circumscription (Cadoli, Donini, Schaerf, & Silvestri, 1997) and 
most operators for belief revision into PL (Cadoli, Donini, Liberatore, & Schaerf, 1999; 
Liberatore, 1995). 

Gogic and collegues (1995) analyzed the relative succinctness of several PKR formalisms 
in representing sets of models. Among other results, they showed that skeptical default logic 
can represent sets of models more succinctly than circumscription. 

Kautz, Kearns, and Selman (1995) and Khardon and Roth (1996, 1997) considered 
representations of knowledge bases based on the notion of characteristic model, comparing 
them to other representations, e.g., based on clauses. They showed that the representation of 
knowledge bases with their characteristic models is sometimes exponentially more compact 
than other ones, and that the converse is true in other cases. 

However, all the above results are based on specific proofs, tailored to a specific re- 
duction, and do not help us to define equivalence classes for the space efficiency of KR 
formalisms. In a recent paper (Cadoli, Donini, Liberatore, & Schaerf, 1996b), a new com- 
plexity measure for decision problems, called compilability, has been introduced. In the 



3 



CADOLI, DONINI, LlBERATORE, & SCHAERF 



present paper we show how this new measure can be directly used to characterize the space 
efficiency of PKR formalisms. We emphasize methodological aspects, expressing in a more 
general context many of the results presented before. 

1.2 Goal 

The notion of polynomial time complexity has a great importance in KR (as well as many 
other fields of computer science), as problems that can be solved in polynomial time are to 
be considered easy, from a computational point of view. 

The notion of polynomial many-one reducibility also has a very intuitive meaning when 
applied to KR: if there exists a polynomial many-one reduction from one formalism to 
another one, then the time complexity of reasoning in the two formalisms is comparable. 
This allows to say, e.g., that inference in PL is coNP-complete, i.e. it is one of the hardest 
problems among those in the complexity class coNP. 

As a result, we have a formal tool for comparing the difficulty of reasoning in two 
formalisms. What is missing is a way for saying that one formalism is able to represent the 
same information in less space. 

Example 2 We consider again the lunch scenario of the previous example. We show that 
we can reduce the size of the representation using circumscription instead of Propositional 
Logic. In PL, the knowledge of the previous example was represented by the formula F: 

F = (sandwichV salad) A (-^sandwichV -^salad) A (waterV coke) A (-^waterV ^coke) 

The set of models of this formula is A, and the models of A are exactly the minimal 
models of the formula F c defined as follows. 

F c = (sandwich V salad) A (water V coke) 

By the definition of circumscription (McCarthy, 1980) it holds that F is equivalent to 
CIRC(F C ; {sandwich, salad, water, coke}, 0, 0). Note that F c is shorter than F. If this result 
can be proved to hold for arbitrary sets of models, we may conclude that circumscription is 
more space efficient than Propositional Logic in representing knowledge expressed as sets of 
models. 

Our goal is to provide a formal way of talking about the relative ability of PKR for- 
malisms to compactly represent information, where the information is either a set of models 
or a set of theorems. In particular, we would like to be able to say that a specific PKR 
formalism provides "one of the most compact ways to represent models/theorems" among 
the PKR formalisms of a specific class. 

1.3 Results 

We introduce two new compactness measures (model and theorem compactness) and the 
corresponding classes (model-C and thm-C, where C is a complexity class like P, NP, coNP, 
etc.). Such classes form two hierarchies that are isomorphic to the polynomial-time hierarchy 
(Stockmeyer, 1976). We show that the relative space efficiency of a PKR formalism is 
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directly related to such classes. In particular, the ability of a PKR formalism to compactly 
represent sets of models/theorems is directly related to the class of the model/theorem 
hierarchy it belongs to. Problems higher up in the model/theorem hierarchy can represent 
sets of models/theorems more compactly than formalisms that are in lower classes. 

This classification is obtained through a general framework and not by making direct 
comparisons and specific translations between the various PKR formalisms. Furthermore, 
our approach also allows for a simple and intuitive notion of completeness for both model 
and theorem hierarchies. This notion precisely characterizes both the relation between 
formalisms at different levels, and the relations between formalisms at the same level. An 
interesting result is that two PKR formalisms in which model checking or inference belong 
to the same time complexity class may belong to different compactness classes. This may 
suggest a criterion for choosing between two PKR formalisms in which reasoning has the 
same time complexity — namely, choose the more compact one. Also, two PKR formalisms 
may belong to the same theorem compactness class, yet to different model compactness 
classes. This stresses the importance of clarifying whether one wants to represent models 
or theorems when choosing a PKR formalism. 

1.4 Outline 

In the next section we introduce the notation and the assumptions that we adopt in this 
work. In Section 3 (Compilability) we briefly recall some notions on non-uniform computa- 
tion that are important for what follows and we recall the basic definitions of compilability 
classes (Cadoli et al., 1996b). In Section 4 (Reductions) we describe the constraints we 
impose on reductions, while in Section 5 (Space Efficiency) we introduce our compactness 
classes. In Section 6 (Applications) we actually compare many known PKR formalisms 
using our framework. Finally, in Section 7 (Related Work and Conclusions) we compare 
our work with other proposals presented in the literature and draw some conclusions. 

2. Notations and Assumptions 

In this section we define what knowledge bases and formalisms are. Since we want to 
consider formalisms that are very different both in syntax and in semantics, we need very 
general definitions. Let us consider, as a base case, the formalism of propositional calculus. 
Formally, we can assume that it is composed of three parts: 

1. a syntax, which is used to define the well-formed formulae; 

2. a proof theory, which allows for saying when a formula follows from another one; and 

3. a model-theoretic semantics, which establishes when a model satisfies a formula. 

The syntax is defined from a finite alphabet of propositional symbols L = {a, b, c, . . .}, 
possibly with subscripts, and the usual set of propositional connectives A, V, ->. 

In terms of knowledge representation, the proof theory can be seen as a way for extract- 
ing knowledge from a knowledge base. For example, if our knowledge base is a Ac, then the 
fact a V b holds. We can thus say that the formula a V b is part of the knowledge represented 
by a Ac. 
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In some cases, we want knowledge bases to represent models rather than sets of formulas. 
An interpretation for an alphabet of propositional variables L is a mapping from L in 
{true, false}. The model-theoretic semantics of the propositional calculus is the usual way 
of extending an interpretation for L to well-formed formulas. 

Let us now extend such definition to generic formalisms: a formalism is composed of a 
syntax, a proof theory, and a model-theoretic semantics. 

We remark that each formalism has its own syntax: for instance, default logic includes 
a ternary connective — '— for denoting default rules, while logic programming has a special 
unary connective not(), and so on. A knowledge base of a formalism F is simply a well- 
formed formula, according to the syntax of the formalism. 

Each formalism has its own proof theory as well. The proof theory of a formalism F is a 
binary relation hi? on the set of knowledge bases and formulae. Intuitively, FB \-p (j> means 
that 4> is a consequence of the knowledge base KB, according to the rules of the formalism 
F. As a result, the set of formulae 4> that are implied by a knowledge base KB is exactly 
the knowledge represented by KB. 

The base of a comparison between two different formalisms is a concept of equivalence, 
allowing for saying that two knowledge bases (of two different formalisms) represent the 
same piece of knowledge. Since the knowledge represented by a knowledge base is the set 
of formulas it implies, we have to assume that the syntax of these formulae is the same 
for all formalisms. Namely, we always assume that the formulae implied by a knowledge 
base are well-formed formulae of the propositional calculus. In other words, each formalism 
has a syntax for the knowledge bases: however, we assume that the proof theory relates 
knowledge bases (formulae in the syntax of the formalism) with propositional formulae. So, 
while writing KB \-p <f>, we assume that KB is a knowledge base in the syntax of F, while 
is a propositional formula. 

This allows for saying that two knowledge bases KB\ and KB2, expressed in two dif- 
ferent formalisms F\ and F2, represent the same piece of knowledge: this is true when, for 
any propositional formula <p it holds KB\ \-p 1 (f> if and only if KB2 Kf 2 4>- 

The model-theoreric semantics of a formalism is a relation \=p between propositional 
models and knowledge bases. In this case, we assume a fixed alphabet L, thus the set of 
all interpretations is common to all formalisms. When a model M and a knowledge base 
KB are in the relation, we write M \=p KB. Intuitively, this means that the model M 
supports the piece of knowledge represented by KB. 

We remark that some formalisms, e.g. credolous default logic (Reiter, 1980), have a 
proof theory, but do not have a model-theoretic semantics. It is also possible to conceive 
formalisms with a model-theoretic semantics but no proof theory. When both of them are 
defined, we assume that they are related by the following formula: 

KB hp (f) iff VI .1 \= KB implies I \= </> 

Regarding the proof theory of formalisms, we only consider formulae that are shorter 
than the knowledge base, that is, we assume that the knowledge represented by a knowlegde 
base KB is the set of formulae <fi such that KB hp 0, and the size of 4> is at most the size 
of KB. This is done for two reasons: first, formulas that are larger than KB are likely to 
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contain large parts that are actually independent from KB; second, we can give technicals 
result in a very simple way by using the compilability classes introduced in the next section. 

Assumption 1 We consider only formulae whose size is less than or equal to that of the 
knowledge base. 

All formalisms we consider satisfy the right-hand side distruibutivity of conjunction, 
that is, KB \-p 4> A fi if and only if KB \-p <p and KB \-p //. The assumption on the size 
of 4> is not restrictive in this case, if <j> is a CNF formula. 

3. Compilability Classes 

We assume the reader is familiar with basic complexity classes, such as P, NP and (uniform) 
classes of the polynomial hierarchy (Stockmeyer, 1976; Garey & Johnson, 1979). Here we 
just briefly introduce non-uniform classes (Johnson, 1990). In the sequel, C, C, etc. denote 
arbitrary classes of the polynomial hierarchy. 

We assume that the input instances of problems are strings built over an alphabet S. 
We denote with e the empty string and assume that the alphabet £ contains a special 
symbol # to denote blanks. The length of a string x € X* is denoted by \x\. 

Definition 1 An advice A is a function that takes an integer and returns a string. 

Advices are important in complexity theory because definitions and results are often 
based on special Turing machines that can determine the result of an oracle "for free" , that 
is, in constant time. 

Definition 2 An advice-taking Turing machine is a Turing machine enhanced with the 
possibility to determine A(\x\) in constant time, where x is the input string. 

Of course, the fact that can be determined in constant time (while A can be 

an intractable or even undecidable function) makes all definitions based on advice-taking 
Turing machine different from the same ones based on regular Turing machine. For example, 
an advice-taking Turing machine can calculate in polynomial time many functions that a 
regular Turing machine cannot (including some untractable ones). 

Note that the advice is only a function of the size of the input, not of the input itself. 
Hence, advice-taking Turing machines are closely related to non-uniform families of circuits 
(Boppana & Sipser, 1990). Clearly, if the advice were allowed to access the whole instance, 
it would be able to determine the solution of any problem in constant time. 

Definition 3 An advice-taking Turing machine uses polynomial advice if there exists a 
polynomial p such that the advice oracle A satisfies \A(n)\ < p(n) for any nonnegative 
integers n. 

The non-uniform complexity classes are based on advice-taking Turing machines. In 
this paper we consider a simplified definition, based on classes of the polynomial hierarchy. 
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Definition 4 If C is a class of the polynomial hierarchy, then C/poly is the class of lan- 
guages defined by Turing machines with the same time bounds as C, augmented by polyno- 
mial advice. 

Any class C/poly is also known as non-uniform C, where non-uniformity is due to the 
presence of the advice. Non-uniform and uniform complexity classes are related: Karp and 
Lipton (1980) proved that if NP C P/poly then Hi? = £2 = PH, i.e., the polynomial hier- 
archy collapses at the second level, while Yap (1983) generalized their results, in particular 
by showing that if NP C coNP/poly then II3 = £3 = PH, i.e., the polynomial hierarchy 
collapses at the third level. An inprovement of this results has been given by Kobler and 
Watanabe (1998): they proved that H p k C X^/poly implies that the polynomial hierar- 
chy collapses to ZPP(S^ +1 ). The collapse of the polynomial hierarchy is considered very 
unlikely by most researchers in structural complexity. 

We now summarize some definitions and results proposed to formalize the compilability 
of problems (Cadoli et al., 1996b), adapting them to the context and terminology of PKR 
formalisms. We remark that it is not the aim of this paper to give a formalization of 
compilability of problems, or to analyze problems from this point of view. Rather, we show 
how to use the compilability classes as a technical tool for proving results on the relative 
efficiency of formalisms in representing knowledge in little space. 

Several papers in the literature focus on the problem of reducing the complexity of 
problems via a preprocessing phase (Kautz & Selman, 1992; Kautz et al., 1995; Khardon 
Sz Roth, 1997). This motivates the introduction of a measure of complexity of problems 
assuming that such preprocessing is allowed. Following the intuition that a knowledge base 
is known well before questions are posed to it, we divide a reasoning problem into two parts: 
one part is fixed or accessible off-line (the knowledge base), and the second one is varying, or 
accessible on-line (the interpretation/formula). Compilability aims at capturing the on-line 
complexity of solving a problem composed of such inputs, i.e., complexity with respect to 
the second input when the first one can be preprocessed in an arbitrary way. In the next 
section we show the close connection between compilability and the space efficiency of PKR 
formalisms. 

A function / is called poly-size if there exists a polynomial p such that for all strings 
x it holds |/(x)| < p(|x|). An exception to this definition is when x represents a number: 
in this case, we impose \f(x)\ < p(x). As a result, we can say that the function A used in 
advice-taking turing machine is a polysize function. 

A function g is called poly-time if there exists a polynomial q such that for all x, g(x) 
can be computed in time less than or equal to ^(|xj). These definitions easily extend to 
binary functions as usual. 

We define a language of pairs S as a subset of E* x £*. This is necessary to represent 
the two inputs to a PKR reasoning problem, i.e., the knowledge base (KB), and the formula 
or interpretation. As an example, the problem of Inference in Propositional Logic (pli) is 
defined as follows. 

pli = {{x,y) I x is a set of propositional formulae (the KB), y is a formula, and x \- y} 

It is well known that pli is coNP-complete, i.e., it is one of the "hardest" problems 
among those belonging to coNP. Our goal is to prove that PLI is the "hardest" theorem- 



8 



Space Efficiency of Propositional Knowledge Representation Formalisms 



proving problem among those in coNP that can be solved by preprocessing the first input 
in an arbitrary way, i.e., the KB. To this end, we introduce a new hierarchy of classes, the 
non-uniform compilability classes, denoted as (p-^C, where C is a generic uniform complexity 
class, such as P, NP, coNP, or £3. 

Definition 5 (Ip-^C classes) A language of pairs S C X* x X* belongs to |p^C iff there 
exists a binary poly-size function f and a language of pairs S' G C such that for all (x, y) G S 
it holds: 

(f(x,\v\),v) e S' iff{x,y)eS 

Notice that the poly-size function / takes as input both x (the KB) and the size of y 
(either the formula or the interpretation). This is done for technical reason, that is, such 
assumption allows obtaining results that are impossible to prove if the function / only 
takes x as input (Cadoli et al., 1996b). Such assuption is useful for proving negative results, 
that is, theorems of impossibility of compilation: indeed, if it is impossible to reduce the 
complexity of a problem using a function that takes both x and \y\ as input, then such 
reduction is also impossible using a function taking x only as its argument. 

Theorem 1 (Cadoli, Donini, Liberatore, & Schaerf, 1997, Theorem 6) Let C be 

a class in the polynomial hierarchy and S C X* x X*. A problem S belongs to |p->C if 
and only if there exists a poly-size function f and a language of pairs S' , such that for all 
(x, y) G X* x X* it holds that: 

1. for all y such that \y\ < k, (f(x,k),y) G S' if and only if (x,y) G S; 

2. S' G C. 

Clearly, any problem whose time complexity is in C is also in ||^C: just take f(x, \y\) = x 
and S' = S. What is interesting is that some problem in C may belong to \{^C with 
C C C, e.g.,, some problems in NP are in |p-*P. This is true for example for some problems 
in belief revision (Cadoli et al., 1999). In the rest of this paper, however, we mainly focus 
on "complete" problems, defined below. A pictorial representation of the class ||^C is in 
Figure 1, where we assume that S' G C. 

For the problem pli no method proving that it belongs to ||^P is known. In order 
to show that it (probably) does not belong to ||^P, we define a notion of reduction and 
completeness. 

Definition 6 (Non-uniform comp-reducibility) Given two problems A and B, A is 
non-uniformly comp-reducible to B ( denoted as A < n u-comp B) iff there exist two poly-size 
binary functions f\ and fi, and a polynomial-time binary function g such that for every 
pair (x,y) it holds that (x,y) G A if and only if {fi(x,\y\),g(f 2 (x,\y\),y)) G B. 

The <nu-comp reductions can be represented as depicted in Figure 2. Such reductions 
satisfy all important properties of a reduction. 

Theorem 2 (Cadoli et al., 1996b, Theorem 5) The reductions <nu-comp satisfy tran- 
sitivity and are compatible (Johnson, 1990) with the class \\^C for every complexity class C. 
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X 



S' 



(x,y) € S 



Figure 1: A representation of |p-»C. 



Figure 2: The nu-comp-C reductions. 



Therefore, it is possible to define the notions of hardness and completeness for |f^C for 
every complexity class C. 

Definition 7 (|(^C-completeness) Let S be a language of pairs and C a complexity class. 
S is ||-^C-hard iff for all problems A € |p-»C we have that A < n u-comp S. Moreover, S is 
|^C-complete if S is in \\^C and is |^C-hard. 

We now have the right complexity class to completely characterize the problem pli. In 
fact pli is ||^coNP-complete (Cadoli et al., 1996b, Theorem 7). Furthermore, the hierarchy 
formed by the compilability classes is proper if and only if the polynomial hierarchy is 
proper (Cadoli et al., 1996b; Karp & Lipton, 1980; Yap, 1983) — a fact widely conjectured 
to be true. 

Informally, we may say that ||-^NP-hard problems are "not compilable to P", as from 
the above considerations we know that if there exists a preprocessing of their fixed part that 
makes them on-line solvable in polynomial time, then the polynomial hierarchy collapses. 
The same holds for ||^coNP-hard problems. In general, a problem which is I^C-complete 
for a class C can be regarded as the "toughest" problem in C, even after arbitrary prepro- 
cessing of the fixed part. On the other hand, a problem in |p-»C is a problem that, after 
preprocessing of the fixed part, becomes a problem in C (i.e., it is "compilable to C"). 

We close the section by giving another example of use of the compilability classes through 
the well-known formalism of Circumscription (McCarthy, 1980). Let x be any propositional 
formula. The minimal models of x are the truth assignments satisfying x having as few 
positive values as possible (w.r.t. set containment). The problem we consider is: check 
whether a given model is a minimal model of a propositional formula. This problem, called 
Minimal Model checking (mmc), can be reformulated as the problem of model checking in 
Circumscription, which is known to be co-NP-complete (Cadoli, 1992). 
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If we consider the knowledge base x as given off-line, and the truth assignment y as 
given on-line, we obtain the following definition: 

MMC = {(x, y) | y is a minimal model of x } 

This problem can be shown to be ||^coNP-complete (Cadoli et al., 1996b, Theorem 13). 
Hence, it is very unlikely that it can be in |f^P; that is, it is very unlikely that there exists 
some off-line processing of the knowledge base, yielding (say) some data structure x', such 
that given y, it can now checked in polynomial time whether y is a minimal model of x. 
This, of course, unless x' has exponential size. This observation applies also when x' is a 
knowledge base in Propositional Logic, and led to the interpretation that Circumscription 
is more compact, or succint, than PL (Cadoli, Donini, & Schaerf, 1995; Gogic et al., 1995). 
Our framework allows to generalize these results for all PKR formalisms, as shown in the 
sequel. 

4. Reductions among KR Formalisms 

We now define the forms of reduction between PKR formalisms that we analyze in the 
following sections. A formula can always be represented as a string over an alphabet S, 
hence from now on we consider translations as functions transforming strings. 

Let F\ and F2 be two PKR formalisms. There exists a poly-size reduction from F\ to 
F2, denoted as / : F\ 1— > _F 2 , if / is a poly-size function such that for any given knowledge 
base KB in Fi, f(KB) is a knowledge base in F2. Clearly, reductions should be restricted 
to produce a meaningful output. In particular, we now discuss reductions that preserve the 
models of the original theory. 

The semantic approach by Gogic and collegues (1995) is that the models of the two 
knowledge bases must be exactly the same. In other words, if a knowledge base KB of the 
formalism F\ is translated into a knowledge base KB' of the formalism F2 , then M \=Fi KB 
if and only if M \=p 2 KB'. This approach can be summarized by: a reduction between 
formalisms Fi and F2 is a way to translate knowledge bases of F\ into knowledge bases of F2 , 
preserving their sets of models. While this semantics is intuitively grounded, it is very easy 
to show examples in which two formalisms that we consider equally space-efficient cannot be 
translated to each other. Let us consider for instance a variant of the propositional calculus 
in which the syntax is that formulas must be of the form x\ A F, where F is a regular 
formula over the variables X2, — Clearly, this formalism is able to represent knowledge 
in the same space than the propositional calculus (apart a polynomial factor). However, 
according to the definition, this formalism cannot be translated to propositional calculus: 
there is no knowledge base that is equivalent to KB = ->xi. Indeed, the only model of 
KB is 0, while any model of any consistent knowledge base of the modified propositional 
calculus contains X\. 

We propose a more general approach that can deal also with functions / that change 
the language of the KB. To this end, we allow for a translation qkb from models of KB to 
models of f(KB). We stress that, to be as general as possible, the translation may depend 
on KB — i.e., different knowledge bases may have different translations of their models. 
We want this translation easy to compute, since otherwise the computation of qkb could 
hide the complexity of reasoning in the formalism. However, observe that to this end, it is 
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not sufficient to impose that qkb is computable in polynomial time. In fact, once KB is 
fixed, its models could be trivially translated to models of f(KB) in constant time, using 
a lookup table. This table would be exponentially large, though; and this is what we want 
to forbid. Hence, we impose that qkb is a circuit of polynomial-size wrt KB. We still use 
a functional notation gKB(M) to denote the result of applying a model M to the circuit 
gKB- A formal definition follows. 

Definition 8 (Model Preservation) A poly-size reduction f : F\ i— ► F 2 satisfies model- 
preservation if there exists a polynomial p such that, for each knowledge base KB in F\ there 
exists a circuit gKB whose size is bounded by p(\K B\) , and such that for every interpretation 
M of the variables of KB it holds that M \= Fl KB iff g K B{M) |=p 2 f(KB). 

The rationale of model-preserving reduction is that the knowledge base KB of the first 
formalism F\ can be converted into a knowledge base f{KB) in the second one F2, and this 
reduction should be such that each model M in F\ can be easily translated into a model 
g K B{M) in F 2 . 

We require g to depend on KB, because the transformation /, in general, could take 
the actual form of KB into account. This happens in the following example of a model- 
preserving translation. 

Example 3 We reduce a fragment of skeptical default logic (Kautz & Selman, 1991) to 
circumscription with varying letters, using the transformation introduced by Etherington 
(1987). Let (D,W) be a prerequisite-free normal (PFN) default theory, i.e., all defaults 
are of the form where 7 is a generic formula. Let Z be the set of letters occurring in 
(D,W). Define Pd as the set of letters {a 7 |^ £ D}. The function f can be defined in 
the following way: f({D,W}) = CIRC(T; P D ; Z), where T = W U {a 7 = ^|a 7 € P D }, 
Pd are the letters to be minimized, and Z (the set of letters occurring in (D,W)) are 
varying letters. We show that f is a model-preserving poly-size reduction. In fact, given 
a set of PFN defaults D let go be a function such that for each interpretation M for Z , 
go(M) = M U {a 7 G Pd\M \= -17}. Clearly, f is poly-size, gr> can be realized by a circuit 
whose size is polynomial in \D\, and M is a model of at least one extension of (D,W) iff 
gr>{M) \= CIRC(T; Pd; Z). The dependence of g only on D stresses the fact that, in this 
case, the circuit g does not depend on the whole knowledge base (D,W), but just on D. 

Clearly, when models are preserved, theorems are preserved as well. A weaker form of 
reduction is the following one, where only theorems are preserved. Also in this case we 
allow theorems of KB to be translated by a "simple" circuit gxB to theorems of KB. 

Definition 9 (Theorem Preservation) A poly-size reduction f : F\ 1— ► F2 satisfies theorem- 
preservation if there exists a polynomial p such that, for each knowledge base KB in F\, 
there exists a circuit gKB whose size is bounded by p(\KB\), and such that for every formula 
ip on the variables of KB, it holds that KB ip iff f (KB) \-p 2 gKBiv)- 

The theorem-preserving reduction has a property similar to that of the model-preserving 
reduction, when the knowledge bases are used to represent theorems rather than models. 
Namely, a knowledge base KB is translated into another knowledge base f(KB) which can 
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be used to represent the same set of theorems. More precisely, we have that each theorem 
(p of KB is represented by a theorem gKBi}f) of f(KB). 

Winslett (1989) has shown an example of a reduction from updated knowledge bases to 
PL that is theorem-preserving but not model-preserving. Using Winslett's reduction, one 
could use the same machinery for propositional reasoning in the KB, both before and after 
the update (plus the reduction). Also the reduction shown in the previous Example 3 is 
theorem-preserving, this time g being the identity circuit. 

We remark that our definitions of reduction are more general than those proposed by 
Gogic and collegues (1995). In fact, these authors consider only a notion analogous to 
Definition 8. and only for the case when g is the identity — i.e., models in the two formalisms 
should be identical. By allowing a simple translation g between models Definition 8 covers 
more general forms of reductions preserving models, like the one of Example 3. 

5. Comparing the Space Efficiency of PKR Formalisms 

In this section we show how to use the compilability classes defined in Section 3 to compare 
the succinctness of PKR formalisms. 

Let F\ and F2 be two formalisms representing sets of models. We prove that any 
knowledge base a in F\ can be reduced, via a poly-size reduction, to a knowledge base (3 
in F2 satisfying model-preservation if and only if the compilability class of the problem of 
model checking (first input: KB, second input: interpretation) in F2 is higher than or equal 
to the compilability class of the problem of model checking in F\. 

Similarly, we prove that theorem-preserving poly-size reductions exist if and only if the 
compilability class of the problem of inference (first input: KB, second input: formula, cf. 
definition of the problem PLl) in F\ is higher than or equal to the compilability class of the 
problem of inference in F%. 

In order to simplify the presentation and proof of the theorems we introduce some 
definitions. 

Definition 10 (Model hardness/completeness) Let F be a PKR formalism and C be 
a complexity class. If the problem of model checking for F belongs to the compilability class 
If^C, where the model is the varying part of the instances, we say that F is in model-C. 
Similarly, if model checking is \\^C- complete (hard), we say that F is model-C -complete 
(hard). 

Definition 11 (Theorem hardness/completeness) Let F be a PKR formalism and 
C be a complexity class. If the problem of inference for the formalism F belongs to the 
compilability class whenever the formula is the varying part of the instance, we say 

that F is in thm-C. Similarly, if inference is \\^C-complete (hard), we say that F is thm- 
C-complete (hard). 

These definitions implicitly define two hierarchies, which parallel the polynomial hierar- 
chy (Stockmeyer, 1976): the model hierarchy (model-P,model-NP, model- T^, etc.) and the 
theorem hierarchy (thm-P,thm-NP,thm-S2,etc). The higher a formalism is in the model 
hierarchy, the more its efficiency in representing models is — and analogously for theorems. 
As an example (Cadoli et al., 1996, Thm. 6), we characterize model and theorem classes of 
Propositional Logic. 
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Theorem 3 PL is in model-P and it is thm-coNP -complete. 

We can now formally establish the connection between succinctness of representations 
and compilability classes. In the following theorems, the complexity classes C, Ci, C2 belong 
to the polynomial hierarchy (Stockmeyer, 1976). In Theorems 5 and 7 we assume that the 
polynomial hierarchy does not collapse. 

We start by showing that the existence of model-preserving reductions from a formalism 
to another one can be easily obtained if their levels in the model hierarchy satisfy a simple 
condition. 

Theorem 4 Let F\ and F2 be two PKR formalisms. If F\ is in model-C and F2 is model- 
C-hard, then there exists a poly-size reduction f : F± 1— ► F2 satisfying model preservation. 

Proof. Recall that since F\ is in model-C, model checking in F\ is in |(^C, and since F2 is 
model-C-hard, model checking in F\ is non-uniformly comp-reducible to model checking in 
F2. That is, (adapting Def. 6) there exist two poly-size binary functions f\ and f'2, and a 
polynomial-time binary function g such that for every pair {KB, M) it holds that 

M \= Fl KB if and only if g{f 2 (KB, |M|), M) h(KB,\M\) 

(note that g is the poly-time function appearing in Def. 6, different from gxB which is the 
poly-size circuit appearing in Def. 8). 

Now observe that \M\ can be computed from KB by simply counting the letters appear- 
ing in KB; let be such a counting function, i.e., \M\ = f%{KB). Clearly, f% is poly-size. 
Define the reduction / as f(KB) = fi(KB, f^(KB)). Since poly-size functions are closed 
under composition, / is poly-size. Now we show that / is a model-preserving reduction. By 
Definition 8, we need to prove that there exists a polynomial p such that for each knowledge 
base KB in F±, there exists a poly-size circuit gxB such that for every interpretation M of 
the variables of KB it holds that M \= Fl KB iff g K B(M) \= F , 2 f(KB). 

We proceed as follows: Given a KB in F\, we compute z = f 2 (KB, \M\) = f 2 (KB, f 3 (KB)). 
Since /2 and fs are poly-size, z has size polynomial with respect to \KB\. Define the circuit 
gKB(M) as the one computing g(z,M) = g(f2(KB, f^(KB)), M). Since g is a poly-time 
function over both inputs, and z is poly-size in KB, there exists a representation of g{z, M) 
as a circuit gKB whose size is polynomial wrt KB. From this construction, M \= Fl KB iff 
gKB(M) \= F , 2 f(KB). Hence, the thesis follows. □ 

The following theorem, instead, gives a simple method to prove that there is no model- 
preserving reduction from one formalism to another one. 

Theorem 5 Let F\ and F2 be two PKR formalisms. If the polynomial hierarchy does not 
collapse, F\ is model-Ci-hard, F2 is in model-C2, and C2 C Ci, then there is no poly-size 
reduction f : F\ 1— > F 2 satisfying model preservation. 

Proof. We show that if such a reduction exists, then Ci/poly C C2/poly which implies that 
the polynomial hierarchy collapses at some level (Yap, 1983). Let A be a complete problem 
for class Ci — e.g., if Ci is £3 then A may be validity of 3V3-quantified boolean formulae 
(Stockmeyer, 1976). Define the problem eA as follows. 

eA = {(x, y) I x = e (the empty string) and y G A} 
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We already proved (Cadoli et al., 1996b, Thm. 6) that eA is |f^Ci-complete. Since model 
checking in F\ is model-Ci-hard, eA is non-uniformly comp-reducible to model checking in 
F\. That is, (adapting Def. 6) there exist two poly-size binary functions f\ and f 2 , and 
a polynomial-time binary function g such that for every pair (e,y), it holds (e,y) € eA if 
and only if g(f 2 (e, \y\),y) \=Fi /i(e, M)- Let \y\ = n. Clearly, the knowledge base /i(e, \y\) 
depends only on n, i.e., there is exactly one knowledge base for each integer. Call it KB n . 
Moreover, /2(e, \y\) = /2(e,n) also depends on n only: call it O n (for Oracle). Observe that 
both KB n and O n have polynomial size with respect to n. 

If there exists a poly-size reduction / : F\ ^ F 2 satisfying model preservation, then given 
the knowledge base KB n there exists a poly-size circuit h n such that g(O n ,y) \=f 1 KB n if 
and only if h n (g(O n ,y)) \=f 2 f{KB n ). 

Therefore, the ||^Ci -complete problem eA can be non-uniformly reduced to a problem 
in |p-*C2 as follows: Given y, from its size \y\ = n one obtains (with a preprocessing) 
f(KB n ) and O n . Then one checks whether the interpretation h n (g(O n ,y)) (computable in 
polynomial time given n, y and O n ) is a model in F 2 for f(KB n ). From the fact that model 
checking in F 2 is in |p-*C2, we have that |f^Ci C |p^C2- We proved in a previous paper 
that such result implies that Ci/poly C C2/poly (Cadoli et al., 1996b, Thm. 9), which in 
turns implies that the polynomial hierarchy collapses (Yap, 1983). □ 

The above theorems show that the hierarchy of classes model-C exactly characterizes 
the space efficiency of a formalism in representing sets of models. In fact, two formalisms 
at the same level in the model hierarchy can be reduced into each other via a poly-size 
reduction (Theorem 4), while there is no poly-size reduction from a formalism (Fi) higher 
up in the hierarchy into one (F2) in a lower class (Theorem 5). In the latter case we say 
that F\ is more space- efficient than F 2 . 

Analogous results (with similar proofs) hold for poly-size reductions preserving theorems. 
Namely, the next theorem shows how to infer the existence of theorem-preserving reductions, 
while the other one gives a way to prove that there is no theorem-preserving reduction from 
one formalism to another one. 

Theorem 6 Let F\ and F 2 be two PKR formalisms. If F\ is in thm-C and F 2 is thm-C- 

hard, then there exists a poly-size reduction f : F± t— > F 2 satisfying theorem preservation. 

Proof. Recall that since F\ is in thm-C, inference in F\ is in If^C, and since F 2 is thm-C- 
hard, inference in F\ is non-uniformly comp-reducible to inference in F 2 . That is, (adapting 
Def. 6) there exist two poly-size binary functions f\ and f 2 , and a polynomial-time binary 
function g\ such that for every pair (KB, tp) it holds that 

KB h Fl ^ if and only if f,(KB, \<p\) h Fa g(f 2 (KB, \<p\), if) 

(here we distinguish the poly-time function g appearing in Def. 6 and the poly-size circuit 
9k B appearing in Def. 9). 

Using Theorem 1 we can replace \<p\ with an upper bound in the above formula. From 
Assumption 1, we know that the size of ip is less than or equal to the size of KB; therefore 
we replace \cp\ with \KB\. The above formula now becomes 

KB h Fl <p if and only if f±(KB, \KB\) h Fi g(f 2 (KB, \KB\), <p) 
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Define the reduction / as f(KB) = fa(KB, fa(KB)), where fa is the poly-size function 
that computes the size of its input. Since poly-size functions are closed under composition, 
/ is poly-size. 

Now, we show that / is a theorem-preserving reduction, i.e.,/ satisfies Def. 9. This 
amounts to proving that for each knowledge base KB in F\ there exists a circuit gxB, 
whose size is poynomial wrt KB, such that for every formula <p on the variables of KB it 
holds that KB h Fl 92 iff f(KB) h Fa g K B{<p)- 

We proceed as in the proof of Theorem 4: Given a KB in F\, let z = fa(KB, fa(KB)). 
Since fa and fa are poly-size, z has polynomial size with respect to Define gKB(f) = 

g(z, ip) = g(fa(KB, fa(KB)), ip). Clearly, gxB can be represented by a circuit of polynomial 
size wrt KB. From this construction, KB h Fl ip iff f(KB) \-p 2 gxBi'p)- Hence, the claim 
follows. □ 



Theorem 7 Let F\ and F2 be two PKR formalisms. If the polynomial hierarchy does not 
collapse, F\ is thm-Ci -hard, F2 is in thm-C2, and C2 C Ci, then there is no poly-size 
reduction f : F\ 1— > Fi satisfying theorem preservation. 

Proof. We show that if such a reduction exists, then Ci/poly C C2/poly and the polynomial 
hierarchy collapses at some level (Yap, 1983). Let A be a complete problem for class Ci. 
Define the problem eA as in the proof of Theorem 5: this problem is |p->Ci-complete (Cadoli 
et al., 1996b, Thm. 6). Since inference in F\ is thm-Ci-hard, eA is non-uniformly comp- 
reducible to inference in F\. That is, (adapting Def. 6) there exist two poly-size binary 
functions f\ and fa, and a polynomial-time binary function g such that for every pair (e, y), 
(e,y) € eA if and only if fa(e, \y\) \~ Fl g(fa(e, \y\),y). Let \y\ = n. Clearly, the knowledge 
base /i(e, |y|) depends just on n, i.e., there is one knowledge base for each integer. Call 
it KB n . Moreover, also fa{e, \y\) = fa(e,n) depends just on n: call it O n (for Oracle). 
Observe that both KB n and O n have polynomial size with respect to n. 

If there exists a poly-size reduction / : F\ 1— > F2 satisfying theorem preservation, then 
given the knowledge base KB n there exists a poly-time function h n such that KB n \-p 1 
g{O n ,y) if and only if f{KB n ) hp 2 h n (g(O n ,y)). 

Therefore, the ||^Ci-complete problem eA can be non-uniformly reduced to a problem in 
\\^C2 as follows: Given y, from its size \y\ = n one obtains (with an arbitrary preprocessing) 
f(KB n ) and O n . Then one checks whether the formula h n (g(O n ,y)) (computable in poly- 
time given y and O n ) is a theorem in F2 of f{KB n ). From the fact that inference in F2 is 
in |p->C2, we have that |p->Ci C |p->C2- It follows that Ci/poly C C2/poly (Cadoli et al., 
1996b, Thm. 9), which implies that the polynomial hierarchy collapses (Yap, 1983). □ 

Theorems 4-7 show that compilability classes characterize very precisely the relative 
capability of PKR formalisms to represent sets of models or sets of theorems. For example, 
as a consequence of Theorems 3 and 7 there is no poly-size reduction from PL to the 
syntactic restriction of PL allowing only Horn clauses that preserves the theorems, unless 
the polynomial hierarchy collapses. Kautz and Selman (1992) proved non-existence of such 
a reduction for a problem strictly related to pli using a specific proof. 
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6. Applications 

This section is devoted to the application of the theorems presented in the previous section. 
Using Theorems 4-7 and results previously known from the literature, we are able to asses 
model- and theorem-compactness of some PKR formalisms. 

We assume that definitions of Propositional Logic, default logic (Reiter, 1980), and 
circumscription (McCarthy, 1980) are known. Definitions of WIDTIO, SBR, GCWA, and 
stable model semantics are in the appropriate subsections. 

In the following proofs we refer to the problem 3V3QBF, that is, the problem of verifying 
whether a quantified Boolean formula BX^Y.^F is valid, where X and Y are disjoint sets of 
variables, and F is a set of clauses on the alphabet X U Y, each composed of three literals. 
As an example, a simple formula belonging to this class is: Ekci, x 2 V?/i, 2/2 V 2/2) A 

(-1X1 V ->x 2 V -iyi) A V -ix 2 V -iy 2 ) A (-1X1 V -1X2))- 

The problem of deciding validity of a 3V3QBF is complete for the class S^. As a conse- 
quence, the corresponding problem *3V3QBF, that is deciding whether an input composed 
of any string (*) as the fixed part and a quantified Boolean formula BAVY.-iF as the varying 
one, is complete for the class If^Sf (Liberatore, 1998). Notice that in most of the hardness 
proofs we show in the sequel we use problems without any meaningful fixed part. 

6.1 Stable Model Semantics 

Stable model semantics (SM) was introduced by Gelfond and Lifschitz (1988) as a tool to 
provide a semantics for logic programs with negation, their original proposal is now one 
of the standard semantics for logic programs. We now recall the definition of propositional 
stable model. 

Let P be a propositional, general logic program. Let M be a subset (i.e., an interpreta- 
tion) of the atoms of P. Let P M be the program obtained from P in the following way: if a 
clause C of P contains in its body a negated atom ->A such that A G M then C is deleted; 
if a body of a clause contains a negated atom -<A such that A M then -<A is deleted from 
the body of the clause. If M is a least Herbrand model of P M then M is a stable model of 
P. 

For the formalism SM, we consider the program P as the knowledge base. We write 
P Nsm Q to denote that query Q is implied by a logic program P under Stable Model 
semantics. 

In order to prove our result, we need to define the kernel of a graph. 

Definition 12 (Kernel) Let G = (V, E) be a graph. A kernel of G is a set K C V such 
that, denoting H = V — K , it holds: 

1. H is a vertex cover of G 

2. for all j G H, there exists an i G K such that (i,j) G E. 

We can now state the theorem on the compilability class of inference in the stable model 
semantics, and the corresponding theorem compactness class. 

Theorem 8 The problem of inference for the Stable Model semantics is ||-^>coNP- complete, 
thus Stable Model Semantics is thm-coNP complete. 
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Proof. Membership in the class follows from the fact that the problem is coNP-complete 
(Marek &; Truszczyhski, 1991). For the hardness, we adapt the proof of Marek and 
Truszczyhski (1991) showing that deciding whether a query is true in all stable models 
is coNP-hard. 

Let eKERNEL be the language {e, G} such that G is a graph with at least one kernel. 
Let |Cr| = n, and observe that G cannot have more vertices than its size n. 

We show that for each n, there exists a logic program P n such that for every graph G 
with at most n vertices, there exists a query Qg such that G has a kernel iff P n ^sm Qg- 

Let the alphabet of P n be composed by the following 2n 2 + n propositional letters: 
{a,i\i G {l..n} } U {r^, Sij\i,j G {l..n} }. 

The program P n is defined as: 

aj ■- ^ai,rij ) 

Sij :- ~>rij > for i,j G {l..n} 

Given a graph G = (V, E), the query Qg is defined as 

Q G = ( V ^) v ( V m) 

(i,j)eE (i,j)#E 

The reduction from eKERNEL to SM is defined as: f\(x,n) = P n , i.e., f\ depends only on 
its second argument, f2(x,n) = e, i.e., /2 is a constant function, and g = Q y , i.e., given a 
graph G, the circuit g computes the query Qg- 

As a result, this is a reduction. We now show that this reduction is correct, i.e., 
(e, G) G eKERNEL (G has a kernel) iff P n ^=sm Qg- 

If-part. Suppose P n \^sm Qg- Then, there exists a stable model M of P n such that 
M |= —>Qg- Observe that ^Qg is equivalent to the conjunction of all such that G E, 
and all -ij-jj such that 2" -E. Simplifying P n with -iQg w e obtain the clauses: 

aj :- -iCi, for G -E (1) 

Observe that M contains all such that (i,j) E 1 , and in order to be stable, — i.e., to 
support atoms r^j such that G E — M contains no atom such that G E. 

Let H = {j\dj G M}, K = {i|aj M}. Now i7 is a vertex cover of G, since for each 
edge G E, M should satisfy the corresponding clause (1) cij :— ->ai, hence either 

aj G M, or aj G M. Moreover, for each j in H, the atom aj is in M, and since M is a 
stable model, there exists a clause aj :— -iaj such that aj M, that is, i £ K. Therefore, 
K is a kernel of G. 

Only-if part. Suppose G = (V, E) has a kernel ET, and let H = V — K. Let M be the 
interpretation 

M = {rij\(i,j) G E} U j) £} U {a,[j G i?} 

Obviously, M \/= Qg- We now show that M is a stable model of P n , i.e., M is a least 
Herbrand model of Pjf - In fact, P^f contains the following clauses: 

8i j foT(i,j)#E (2) 

Tij ioi(i,j)€E (3) 

a j : ~ r ij for i G -ftT (4) 
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Clauses in the last line are obtained from clauses in P n of the form dj :— -iaj, r^-, where the 
clauses such that i € H (hence a, £ M) are deleted, while in the other clauses the negated 
atom -idj is deleted, since i £ K, hence a% M. Now for each £ M, the vertex j is in H, 
hence there is an edge € £7, and i £ K. Hence clauses (4) and (3) are in P^ 1 , hence in 
the least Herbrand model of P?f there are exactly all aj such that j £ H. □ 

6.2 Minimal Model Reasoning 

One of the most successful form of non-monotonic reasoning is based on the selection of 
minimal models. Among the various formalisms based on minimal model semantics we con- 
sider here Circumscription (McCarthy, 1980) and the Generalized Closed World Assumption 
(GCWA) (Minker, 1982), which is a formalism to represent knowledge in a closed world. 

We assume that the reader is familiar with Circumscription, we briefly present the 
definition of GCWA. The model semantics for GCWA is defined as (a is a letter): 

M \=gcwa KB iff M |= KB\J{->a | for any positive clause 7, if KB \f 7 then KB ^Va} 

We can now present the results for these two formalisms. 

Theorem 9 The problem of model checking for Circumscription is It^coNP- complete, thus 
Circumscription is model-coNP- complete. 

This result is a trivial corollary of a theorem already proved (Cadoli et al., 1997, The- 
orem 6). In fact, that proof implicitly shows that model checking for circumscription is 
|[^>coNP-complete. 

Theorem 10 The problem of model checking for GCWA is in |f^P, thus GCWA is in 
model-P. 

Proof. As already pointed out (Cadoli et al., 1997), it is possible to rewrite GCWA(T) 
into a propositional formula F such that, for any given model M, M \= GCW A{T) if and 
only if M \= F. Moreover, the size of F is polynomially bounded by the size of T. As a 
consequence, the model compactness for GCWA is in the same class of PL. By Theorem 3 
the thesis follows. □ 

Theorem 11 The problem of inference for Circumscription is If^II^- complete, thus Cir- 
cumscription is thm-U^-complete. 

This result is a trivial corollary of a theorem published in a previous paper (Cadoli 
et al., 1997, Theorem 7) which implicitly shows that inference for circumscription is |p-> rig- 
complete. 

Theorem 12 The problem of inference for GCWA is It^coNP -complete, thus GCWA is 
thm-coNP -complete. 

Proof. As already pointed out in the proof of Theorem 10, it is possible to rewrite 
GCW A(T) into a formula F that is equivalent to it. As a consequence, a formula a is 
a theorem of GCW A(T) if and only if it is a theorem of F. Thus, GCWA has at most the 
theorem compexity of PL. Since GCWA is a generalization of PL, it follows that GCWA is 
in the same theorem compactness class of PL. Hence, GCWA is thm-coNP-complete. □ 
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6.3 Default Logic 

In this subsection we present the results for default logic, in its two variants (credulous 
and skeptical). For more details on these two main variants of default logic, we refer the 
reader to the paper by Kautz and Selman (1991). Notice that model-compactness is only 
applicable to skeptical default logic. 

Theorem 13 The problem of model checking for skeptical default logic is j^Sf complete, 
thus skeptical default logic is model- complete. 

Proof. The proof of membership is straightforward: since model checking for skeptical 
default logic is in (Liberatore & Schaerf, 1998), it follows that it is also in Ip-^S^. 

The proof of I^X^-hardness is similar to the proof of X^-hardness (Liberatore & Schaerf, 
1998). The reduction is from the problem *3V3QBF. Let {a, (3) be an instance of *3V3QBF, 
where (3 = 3X\/Y.^F represents a valid 3V3QBF formula, and a is any string. 

Let n be the size of the formula F. This implies that the variables in the formula are at 
most n. Let T = {71, . . . , 7^} be the set of all the clauses of three literals over this alphabet. 
The number of clauses of three literals over an alphabet of n variables is less than 0(n 3 ), 
thus bounded by a polynomial in n. 

We prove that BXMY.^F is valid if and only if M is a model of some extension of {W, D), 
where 



W = 

D II { :Ci 1 I u I I ( : w A ( w — :wA ( w ^ ~^ x i) \ u [ : w A A 7 ,gr a -»• H 

7 ^ r l Ci ' -.Cj J W^Xi ■ W^^Xi J \ W 

M = {a I 7i € F} 

The set {cj | 1 < i < k} is a set of new variables, one-to-one with the elements of T. 
Note that W and D only depends on the size n of F, while M depends on F. As a result, 
this is a < n u-comp reduction. 

We now prove that the formula is valid if and only if M is a model of some extension 
of the default theory (W,D). This is similar to an already published proof (Liberatore & 
Schaerf, 1998). Consider an evaluation C\ of the variables {cj} and an evaluation X\ of the 
variables X. Let D' be the following set of defaults. 

jji _ 1 1 f -Cj} 1 , f : --Cj \ u , ■ f :wA(w^x t ) \ , , f : w A (w -» -jgi) 

This set of defaults has been chosen so that the set R of its consequences corresponds 
to the sets C\ and X\. Namely, we have: 



Cj € C\ iff i? |= q 

Ci^C\ iff i? (= -icj 

Xj G Xi iff i? |= w — > Xj 

Xi ^ X\ iff i? |= u; — > -iXj 
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Now, we prove that the consequences of this set of defaults are an extension of the 
default theory if and only if the QBF formula is valid. Since all defaults are semi-normal, 
we have to prove that: 

1. the set of consequences of D' is consistent; and 

2. no other default is applicable, that is, there is no other default whose precondition is 
consistent with R. 

Consistency of R follows by construction: assigning a to true for each a 6 C\, etc., we 
obtain a model of R. 

We have then to prove that no other default is applicable. If Cj £ C±, the default ^°^ ± 
is not applicable, and vice versa, if -iCj € C\, then — is not applicable. Moreover, none 

of the defaults ' wA ^^ Xi ^ , is applicable if Xi & Xi, because in this case w — > ->Xi £ R, thus 
-iw would follow (while w is a justification of the default). A similar statement holds for 
:wA( "*' } if Xi€XL 

:wA/\ 6r Ci^7i 

As a result, the only applicable default may be the last one, (recall that 

F is negated). This default is applicable if and only if, for the given evaluation of the Cj's 
and Xj's, the set of clauses is satisfiable. This amount to say: "there is an extension in 
which the last default is not applicable if and only if the QBF formula is valid". Now, 
if the last default is applicable, then M is not a model of the extension because w is the 
consequence of the last default while w ^ M. The converse also holds: if the last default is 
not applicable then M is a model of the default theory. 

As a result, the QBF is valid if and only if M is a model of the given default theory. □ 

Theorem 14 The inference problem for skeptical default logic is Ip^II^ complete, thus skep- 
tical default logic is thm-LT^ complete. 

Proof. Since inference in skeptical default logic is in LT^, it is also in If^II^. I^n^-hardness 
comes from a simple reduction from circumscription. Indeed, the circumscription of a 
formula T is equivalent to the conjunction of the extensions of the default theory (T,D), 
where (Etherington, 1987): 



As a result, CIRCiT) \= Q if and only if Q is implied by (T, D) under skeptical seman- 
tics. Since (T, D) only depends on T (and not on Q) this is a < n u~comp reduction. Since 
inference for circumscription is I^II^-complete (see Theorem 11), it follows that skeptical 
default logic is If^n^-hard. □ 

Theorem 15 The inference problem for credulous default logic is If^Ej complete, thus 
credulous default logic is thm-S^ complete. 
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Proof. The proof is very similar to the proof for model checking of skeptical default logic. 
Indeed, both problems are If^S^ complete. Since the problem is in E^, as proved by Gottlob 
(1992), it is also in If^E^. Thus, what we have to prove is that is hard for that class. 

We prove that the *3V3QBF problem can be reduced to the problem of verifying whether 
a formula is implied by some extensions of a default theory (that is, inference in credulous 
default logic). 

Namely, a formula VXBY.^F is valid if and only if Q is derived by some extension of 
the default theory (D,W), where W and D are defined as follows (r is the set of all the 
clauses of three literals over the alphabet of F, and C is a set of new variables, one-to-one 
with r). 



W = 



i v Cj 'C? J _ -* r I Xj '3/7 J HJ 



q = A Ci A A ^ A 



w 



Informally, the proof goes as follows: for each truth evaluation of the variables in C 
and X there is a set of defaults which are both justified and consistent. A simple necessary 
and sufficient condition for the consequences of this set of defaults to be an extension is the 
following. If, in this evaluation, the formula 

- A 7, 

Ci=true 

is valid, then the last default is applicable, thus the extension also contains w. The converse 
also holds: if the formula is not valid in the evaluation, then the variable w is not in the 
extension. 

As a result, there exists an extension in which Q holds if and only if there exists an 
extension in which each q is true if and only if 7, G F, and such that w also holds. When 
the variables q have the given value, the above formula is equivalent to ->F. As a result, 
such an extension exists if and only if there exists a truth evaluation of the variables X in 
which ->F is valid. □ 



6.4 Belief Revision 

Many formalisms for belief revision have been proposed in the literature, here we focus on 
two of them: WIDTIO (When In Doubt Throw it Out) and SBR (Skeptical Belief Revision) . 
Let K be a set of propositional formulae, representing an agent's knowledge about the world. 
When a new formula A is added to K, the problem of the possible inconsistency between K 
and A arises. The first step is to define the set of sets of formulae W(K, A) in the following 
way: 

W(K, A) = {K' I if' is a maximal consistent subset of K U {^4} containing A } 



22 



Space Efficiency of Propositional Knowledge Representation Formalisms 



Any set of formulae K' G W(K, A) is a maximal choice of formulae in K that are 
consistent with A and, therefore, we may retain when incorporating A. The definition of 
this set leads to two different revision operators: SBR and WIDTIO. 

SBR Skeptical Belief Revision (Fagin, Ullman, & Vardi, 1983; Ginsberg, 1986). The revised 
theory is defined as a set of theories: K * A = {K' | K' 6 W(K, A)}. Inference in the 
revised theory is defined as inference in each of the theories: 

K * AhsBR Q iff for all K' G W(K, A) , we have that K' h Q 

The model semantics is defined as: 

M \= S br K * A iff there exists a K' £ W(K, A) such that M \= K' 

WIDTIO When In Doubt Throw It Out (Winslett, 1990). A simpler (but somewhat 
drastical) approach is the so-called WIDTIO, where we retain only the formulae of K 
that belong to all sets of W(K, A). Thus, inference is defined as: 

K*Ah WIDTIO Q iff f\W{K,A)\-Q 

The model semantics of this formalism is defined as: 

M\= WIDTIO K*A iff M\=f]W(K,A) 

The results on model compactness have been shown by Liberatore and Schaerf (2000). 
Here we recall them. 

Theorem 16 (Liberatore &; Schaerf, 2000, Theorem 11) The problem of model check- 
ing for WIDTIO is in |^P, thus WIDTIO is in model-P. 

Theorem 17 (Liberatore &: Schaerf, 2000, Theorem 5) The problem of model check- 
ing for Skeptical Belief Revision is ||^coNP -complete, thus Skeptical Belief Revision is 
model-coNP -complete. 

The results on theorem compactness are quite simple and we provide here the proofs. 

Theorem 18 The problem of inference for WIDTIO is ||^coNP-comp/ete, thus WIDTIO 
is thm-coNP-comp/ete. 

Proof. Membership in the class thm-coNP immediately follows from the definition. In fact, 
we can rewrite K * A into a propositional formula by computing the set W(K, A) and then 
constructing their intersection. By construction their intersection has size less than or equal 
to the size of K U A. As a consequence, after preprocessing, deciding whether a formula Q 
follows from K * A is a problem in coNP. Hardness follows from the obvious fact that PL 
can be reduced to WIDTIO and PL is thm-coNP-complete (see Theorem 3). □ 

Theorem 19 The problem of inference for Skeptical Belief Revision is W^H^- complete, 
thus Skeptical Belief Revision is thm-H^-complete. 
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Time Complexity 


Space Efficiency 


Propositional 
Logic 


P 


mo del- P 


WTDTTO 


y~! - pa m n 1 p f p 

(Liberatore & Schaerf, 1996) 


mnrlpl-P 

Th. 16 


X k"PTii~ip£i 1 

Belief Revision 


pnlVP-pnrmrJptp 

(Liberatore & Schaerf, 1996) 


mpiri pl-PPtlMP-PPtrnTilp^p 

Th. 17 




ppil\TT-*-ppim"nj|p1~p 

(Cadoli, 1992) 


mpiri pl-PPtlMP-PPtrnTilp^p 

Th. 9 


GCWA 


coNP-hard 
in A^logn] 
(Eitcr & Gottlob, 1993) 


tdoHp-I-P 
Th. 10 


Skeptical 
Default Reasoning 


Sj-complete 
(Liberatore & Schaerf, 1998) 


model-£2" com pl e tc 
Th. 13 


Credulous 
Default Reasoning 


N/A 


N/A 


Stable Model 
Semantics 


P 


model-P 



Table 1: Complexity of model checking and Space Efficiency of Model Representations 

Proof. Membership follows from the complexity results of Eiter and Gottlob (1992), where 
they show that deciding whether K * A \~sbr Q is a LT^-complete problem. Hardness 
follows easily from Theorem 17. In fact, M \=sbr K * A iff K * A \/sbr ~ , form(M), where 
form(M) is the formula that represents the model M. As a consequence, model checking 
can be reduced to the complement of inference. Thus inference is H^LT^-complete. □ 

6.5 Discussion 

Tables 1 and 2 summarize the results on space efficiency of PKR formalisms and where they 
were proved (a dash "— " denotes a folklore result). 

First of all, notice that space efficiency is not always related to time complexity. As an 
example, we compare in detail WIDTIO and circumscription. From the table it follows that 
model checking is harder for WIDTIO than for circumscription, and that inference has the 
same complexity in both cases. Nevertheless, since circumscription is thm-S^-complete and 
WIDTIO is thm-coNP-complete (and thus in thm-Eg), there exists a poly-size reduction 
from WIDTIO to circumscription satisfying theorem preservation. The converse does not 
hold: since circumscription is thm-S^-complete and WIDTIO is thm-coNP, unless the Poly- 
nomial Hierarchy does not collapse there is no theorem-preserving poly-size reduction from 
the former formalism to the latter. Hence, circumscription is a more compact formalism 
than WIDTIO to represent theorems. Analogous considerations can be done for models. 
Intuitively, this is due to the fact that for WIDTIO both model checking and inference 
require a lot of work on the revised knowledge base alone — computing the intersection of 
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Time Complexity 


Space Efficiency 


Propositional 
Logic 


coNP-complete 
(Cook 1971) 


thm-coNP-complete 
(Cadoli et al 1996) 


WIDTIO 


ilj-complete 
(Eitcr Gottlob 1992) (Nebel 1998) 


thm-coNP-complete 
Th. 18 


Skeptical 
RpIipt Revision 


n^-complete 
(Eher Cottlob 1992) 


thm-II^-complete 
Th. 19 


Circumscription 


ilj-complete 
(Eitcr & Gottlob, 1993) 


thm-II^-complete 
Th. 11 


GCWA 


Il^-complete 
(Eitcr & Gottlob, 1993) & (Nebel, 1998) 


thm-coNP-complete 
Th. 12 


Skeptical 
Default Reasoning 


LT^-complete 
(Gottlob, 1992) 


thm-II^-complete 
Th. 14 


Credulous 
Default Reasoning 


S^-complete 
(Gottlob, 1992) 


thm-E2" com pl c te 
Th. 15 


Stable Model 
Semantics 


coNP-complete 
(Marek & Truszczyriski, 1991) 


thm-coNP-complete 
Th. 8 



Table 2: Complexity of inference and Space Efficiency of Theorem Representations 

all elements of W(K, A). Once this is done, one is left with model checking and inference in 
PL. Hence, WIDTIO has the same space efficiency as PL, which is below circumscription. 

Figures 3 and 4 contain the same information of Tables 1 and 2, but highlight exist- 
ing reductions. Each figure contains two diagrams, the left one showing the existence of 
polynomial-time reductions among formalisms, the right one showing the existence of poly- 
size reductions. An arrow from a formalism to another denotes that the former can be 
reduced to the latter one. We use a bidirectional arrow to denote arrows in both directions 
and a dashed box to enclose formalisms that can be reduced one into another. Note that 
some formalisms are more appropriate in representing sets of models, while others perform 
better on sets of formulae. An interesting relation exists between skeptical default reasoning 
and circumscription. While there is no model-preserving poly-size reduction from circum- 
scription to skeptical default reasoning (Gogic et al., 1995), a theorem-preserving poly-size 
reduction exists, as shown by Theorem 14. 

7. Related Work and Conclusions 

The idea of comparing the compactness of KR formalisms in representing information is not 
novel in AL It is well known that first-order circumscription can be represented in second- 
order logic (Schlipf, 1987). Kolaitis and Papadimitriou (1990) discuss several computational 
aspects of circumscription. Among many interesting results they show a reduction from a 
restricted form of first-order circumscription into first-order logic. The proposed reduction 
will increase the size of the original formula by an exponential factor. It is left as an open 
problem to show whether this increase is intrinsic, because of the different compactness 
properties of the two formalisms, or there exists a more space-efficient reduction. When a 
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Figure 3: Complexity of Model Checking vs. Space Efficiency of Model Representation 
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Figure 4: Complexity of Inference vs. Space Efficiency of Theorem Representation 
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first-order language is used, more results on compactness and existence of reductions are 
reported by Schlipf (1995). 

Khardon and Roth (1996, 1997), and Kautz, Kearns and Selman (1995) propose model- 
based representations of a KB in Propositional Logic, and compare it with formula-based 
representations. Although their results are significant for comparing representations within 
PL, they refer only to this formalism, hence they are not applicable to our comparison be- 
tween different PKR formalisms. The same comment applies also to the idea of representing 
a KB with an efficient basis by Moses and Tennenholz (1996), since it refers only to one 
PKR formalism, namely, PL. 

An active area of research studies the connections of the various non-monotonic logics. 
In particular, there are several papers discussing the existence of translations that are poly- 
nomial in time and satisfy other intuitive requirements such as modularity and faithfulness. 
Janhunen (1998), improving on results of Imielinski (1987) and Gottlob (1995), shows that 
default logic is the most expressive, among the non-monotonic logics examined, since both 
circumscription and autoepistemic logic can be modularly and faithfully embedded in de- 
fault logic, but not the other way around. While these results are of interest and help to 
fully understand the relation among many knowledge representation formalisms, they are 
not directly related to ours. In fact, we allow for translations that are more general than 
polynomial time, while in all of the above papers they only consider translations that use 
polynomial time and also satisfy additional requirements. 

The first result on compactness of representations for a propositional language is pre- 
sented, to the best of our knowledge, by Kautz and Selman (1992). They show that, unless 
there is a collapse in the polynomial hierarchy, the size of the smallest representation of 
the least Horn upper bound of a propositional theory is superpolynomial in the size of the 
original theory. These results are also presented in a different form in the more compre- 
hensive paper (Selman & Kautz, 1996). The technique used in the proof has been then 
used by us and other researchers to prove several other results on the relative complexity of 
propositional knowledge representation formalisms (Cadoli et al., 1996, 1997, 1999; Gogic 
et al., 1995). 

In a recent paper (Cadoli et al., 1996b) we introduced a new complexity measure, i.e., 
compilability. In this paper we have shown how this measure is inherently related to the 
succinctness of PKR formalisms. We analyzed PKR formalisms with respect to two succinct- 
ness measures: succinctness in representing sets of models and succinctness in representing 
sets of theorems. 

The main advantage of our framework is the machinery necessary for a formal way of 
talking about the relative ability of PKR formalisms to compactly represent information. In 
particular, we were able to formalize the intuition that a specific PKR formalism provides 
"one of the most compact ways to represent models / theorems" among the PKR formalisms 
of a specific class. 

In our opinion, the proposed framework improves over the state of the art in two different 
aspects: 

1. All the proofs presented in the previous papers only compare pairs of PKR for- 
malisms, for example propositional circumscription and Propositional Logic (Cadoli 
et al., 1997). These results do not allow for a precise classification of the level of 
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compactness of the considered formalisms. Rephrasing and adapting these results 
in our framework allows us to infer that circumscription is model-coNP-complete and 
thm-II^-complete. As a consequence, we also have that it is more space-efficient of the 
WIDTIO belief revision formalism in representing sets of models or sets of theorems. 

2. Using the proposed framework it is now possible to find criteria for adapting existent 
polynomial reductions showing C-hardness into reductions that show model-C or thm- 
C-hardness, where C is a class in the polynomial hierarchy (Liberatore, 1998). 
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